MULTI-TIME SCALES IN ADAPTIVE DYNAMICS: MICROSCOPIC 
INTERPRETATION OF A TRAIT SUBSTITUTION TREE MODEL 
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Abstract. We consider a fitness-structured population model with competition and 
migration between nearest neighbors. Under a combination of large population and rare 
migration limits we are particularly interested in the asymptotic behavior of the total 
population partition on supporting trait sites. For the population without mutation on 
a finite trait space we obtain the equilibrium configuration and characterize the right 
time scale for fixation. For the model with mutation on an infinite trait space a jump 
process-trait substitution tree model is established on a rarer mutation time scale against 
the rare migration constrained in terms of a large population limit. Due to a change of 
the fitness landscape provoked by a new mutant, some temporarily unfit types can be 
recovered from time to time. In the end we shed light to illustrate sexual reproduction 
in a diploid population on the genetic level. 



In recent years a spatially structured population with migration (dispersion) and local 
regulation, proposed by Bolker and Pacala [I], Dieckmann and Law [17J (in short BPDL 
process), has attracted particular interest both from biologists and mathematicians. It 
has several advantages over general branching processes, which make it more natural as 
population models: the quadratic competition term is used to prevent the population size 
from escaping to infinity and the migration term is used to transport the population mass 
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from one colony to unoccupied colonies for survival, and to further get colonized. There 
are mainly two highlights of related papers. For instance, Etheridge Fournier and 
Meleard '1 i|. Hutzenthaler and Wakolbinger [IE] , have studied the extinction and survival 
problems. Champagnat [3J, Champagnat and Lambert [?], Champagnat and Meleard [SJ, 
Meleard and Tran [IB], Dawson and Greven [S] focus more on its long time behavior by 
multi-scale analysis methods. 

The main ingredient behind this model is logistic branching random walks, that is, a 
combination of logistic branching populations with spatial random walks (or migration) 
on trait sites. Under a combination of a large population and rare mutation limits, a 
so-called trait substitution sequence model (in short TSS) is derived in [3j. The heuristics 
leading to the TSS model is based on the biological assumptions of large population and 
rare mutation, and on another assumption that no two different types of individuals can 
coexist on a long time scale: the selective competition eliminates one of them. On the 
one hand, coexistence and diversity after entering of new mutants are not allowed due to 
the deficient spatial structure. On the other hand, natural selection is not only limited 
to competition mechanism but also is often combined with a survival strategy-migration 
mechanism. In spite of this heuristics, this model is still lack of a rigorous mathematical 
basis. 

The adaptive-dynamics approach is controversially debated since it was criticized only 
feasible in the context of phenotypic approach. However, the link with its corresponding 
genetic insight has rarely been developed (see Eshel [HI])- As far as sexual reproduction 
is concerned, population genetic models have dominated for many years since they have 
been proved powerful to model diploid populations on the genetic level. For a finite gene 
pool of fixed size, main evolutionary mechanisms like mutation, selection, and reproduc- 
tion are theoretically tractable though they can take a role in a very complicated way 
especially after sexual reproduction gets involved. The effect of sexual reproduction is 
more complicated to characterize mostly because random shuffling of genes may create 
many genotypes for natural selection to act on, which makes mathematical analysis more 
difficult (e.g., see [7] for the case with three genotypes combined by two alleles). In con- 
trast, adaptive-dynamics approach is mainly concerned with the long-term evolutionary 
property but usually ignores the genetic complications. Is there a way to embody features 
such as sexual reproduction arising on the genetic level but at the same time in which one 
can study its long term behavior via the quantitative trait method, i.e., taking advantage 
of adaptive-dynamics approach on the phenotypic level? This is the biological motivation 
of this paper. 

In this paper we propose a new model to justify the above arguments. We introduce a 
spatial migration mechanism among possible genotypes, which can be viewed as a result 
of fusion of any pair of alleles out of a fixed finite allele pool of a diploid population. 
After natural selection acting on a short-term evolution time scale, the population can 
attain an equilibrium configuration according to the fitness landscape. Each time there 
enters a new mutant gene (allele) into the gene pool, the genotype space is enlarged due 
to formation of new genotypes, and the spatial migration can be used to characterize the 
reshuffling procedure on the way to a new equilibrium configuration. Loosely speaking, 
the spatial movement is used to compensate the simplicity of genetic reproduction in 
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adaptive-dynamics approach. The critical point we need to take care of is to distinguish 
these different time scales after introducing fitness spatial structure in the model. 

The novelty of this model differs from previous models in three key aspects. Firstly, 
no genetic information is lost on any time scale. Some genotype containing a specific 
deleterious allele may be invisible due to its temporarily low fitness on the migration time 
scale, but it can recover on a longer mutation time scale due to the entering of a new 
mutant allele and the reshuffling of the genotype space. For example, some epidemic 
virus may become popular periodically because of a change of its mutated genetic struc- 
ture or a genetic change of its potential carrier. Secondly, thanks to the fitness spatial 
structure endowed on finite genotype space, coexistence is allowed under the assumption 
of nearest-neighbor competition and migration. This distinguishes our model from the 
classical adaptive-dynamics, which often converges to a monomorphic equilibrium. What 
is more, we derive a well-defined branching tree structure in the limiting system, which 
is like a spatial version of the Galton- Watson branching process. Last but not least, the 
idea of introducing the spatial migration to interpret sexual reproduction can provide a 
link between adaptive-dynamics and its genetics counterpart. In particular, similar con- 
sideration can be done to quantify more complicated sexual reproduction model than our 
toy case from the genetics side by mapping it to spatial migration model (see Section [5]). 

As a reminder, we want to mention some recent progress in the interacting fields of 
adaptive dynamics and genetics. Champagnat and Meleard [5] relax the assumption of 
non-coexistence condition in [3] and obtain a polymorphic evolution sequence (PES) as a 
generalization of the TSS model, allowing coexistence of several traits in the population. 
However, still unfit allelic traits can be excluded from the evolutionary history and may 
never recover, depending on the Jacobian matrices of Lotka-Volterra systems. Recently, 
Collet, Meleard and Metz [7] consider a diploid population model with sexual reproduc- 
tion, and obtain that population behaves on the mutation time scale as a jump process 
moving between homozygous states (genotypes comprising of a pair of identical alleles). 
Although their model puts a rigorous basis on Mendelian diploids, as mentioned in [TJ 
Section 6], it is still under the restriction of an unstructured population and single locus 
genetics. Evans et al. [HI Q2] study a continuous time evolving distribution of genotypes 
called mutation-selection balance model where recombination acts on a faster time scale 
than mutation and selection. The intuition behind their asymptotic result is that the mu- 
tation preserves the Poisson property whereas selection and recombination respectively 
drive the population distribution away from and toward Poisson. If all three processes are 
operating together, one expects that recombination mechanism disappears in the limiting 
system. This in some sense motivates us to specify migration in adaptive-dynamics to 
express one kind of genetic reshuffling like recombination in Evan's model. And migration 
should act on some well-defined fitness structure. 

In this paper we are interested in the case when the migrant event is rare with respect 
to branching events but not that rare as in [3| (see Figure [T]). In contrast, we assume that 
there are infinite migrants from a resident population on the natural time scale. Let a 
parameter e be the migration rate and K be proportional to the initial population size. We 
will impose the rare migration constraint 1 <C Ke <C K on the population (see parameter 
region II in Figure Q. As far finite-trait dynamic system is concerned, to find out the 
exact fixation time scale expressed in terms of the migration rate and population size is of 
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particular interest for us. Since the original model is not easy for us to study due to the 
complicated interactions, we present here a slightly modified model of the one in [3] but 
retaining the essential machinery founded in the original model. This paper is restricted 
with nearest-neighbor competitions and migrations along the monotone fitness landscape. 
What is more, in order to study the long time behavior, we introduce mutations to drive 
the population to move forwards to more fitter configuration on a rare mutation time 
scale, which is longer than the fixation time scale. Note that the limit theorem arising in 
[3] can be applied consistently in the model developed in this paper. 

The purpose of this paper and the accompanying one [2] is to justify a trait substi- 
tution tree process (in short TST) to illustrate the coexistence phenomenon with spatial 
structure in evolution theory, which is a purely atomic finite measure-valued process. The 
present one is derived from the microscopic point of view while the other one [2] is from 
the macroscopic point of view. Combining these two papers together with [21 E] , the entire 
framework on (rare) migration against (large) population limit can be fully characterized, 
and it results in different rescaling limits, TSS and TST respectively on different time 
scales. In summary, the entire framework is as follows: 

• Take large population and rare migration simultaneously by Ke <C ^Jci ^ leads 
to a TSS limit in [3]. 

• Firstly let K — > oo, then add rare mutation by In - <C - as e — > 0, it leads to a 
TST limit in 0. 

• Take large population, rare migration and even rarer mutation all simultaneously 
constrained by 1 Ke <C K, In - <C . That is our goal in this paper. 




K— >oo 



FIGURE 1. Division of parameter region: migration rate e against population size K 



The remainder of the paper is structured as follows. In Section [2j we present a 
description of the individual-based model. In Section [3| we consider the case without 
mutation but on a finite trait space, and characterize the rare migration limit against the 
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large population limit. In Section [4], concerning a modified population supported on an 
infinite trait space by introducing mutations, we justify a so-called trait substitution tree 
processes in the rare mutation limit, which also appeared in J2]. In section [5j we apply 
the previous results to a diploid population. In the last section, related proofs for results 
in previous sections are provided. 



2. Microscopic model 

We begin with a description of an individual-based model. Assume that the pop- 
ulation at time t is composed of a finite number I t individuals characterized by their 
phenotypic traits xi(t), . . . ,xi t (t) belonging a compact subset X of lR d . We denote by 
M.f(X) the set of non-negative finite measures on X. Let Ai a (X) C M.f(X) be the set 
of counting measures on X: 

J2S Xi :x u ...,x n eX,neN 

Then, the population process at time t can be represented as: 

h 

i=l 

Let B(X) denote the totality of functions on X which are bounded and measurable. For 
any / G B(X), v G A4f(X), we use notation (z/, f) = f fdv. 

Let's specify the population process (z^)t>o by introducing a sequence of demographic 
parameters: 

• b(x) is the birth rate from an individual with trait x. 

• d(x) is the death rate of an individual with trait x because of "aging" . 

• a(x, y) is the competition kernel felt by some individual with trait x from another 
individual with trait y. 

• m(x, dy) is the migration law of an individual from trait site x to site y. 

• fi(x) is the mutation rate of an individual with trait x. 

• p(x, dh) is the law of mutant variation h = y — x between a mutant y and its 
resident trait x. Since the mutant trait y = x + h should belong to X, this law 
has its support in X — x := {y — x : y G X} C M. d . 

To specify the model without mutation mechanism, the infinitesimal generator of the 
A4 a {X)-vaiued process is given as follows, for any F G B(Ai a (X)): 

i 

LF{u) = Y,\F{v + 5 Xl )-F(v)]b(x l ) 



+ y 52[F("-8* i )-F(v)] (d(x t ) + J2 a (^ x j)) (2.1) 
i=i \ j^i J 



[F{^ + ^-5 Xi )-F{v)}m{x h Xj). 
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The first term above describes the clonal reproduction at the mother's site. The second 
term describes death of an individual X{ either due to aging or competition from another 
individual Xj. And the last term describes the migration of an individual from a trait site 
Xi to a site Xj. 

By introducing a parameter K G N, we rescale the population size and competition 
kernel by K. We will show later, as K tends to infinity, one can get different large pop- 
ulation limits by various well-chosen rescaling procedures. Furthermore, the population 
process can be parameterized by another parameter e governing the rate of migration law 
m(xi,Xj) in terms of population size scaling parameter K. 

For any K G N, instead of studying the above process (z/^) t >o, it is more convenient 
to consider a sequence of rescaled measure-valued processes: 



X 



K 



1 

K 1 



K 



K 



(2.2) 



where X is a Aip(X) — valued process with the following infinitesimal generator: 



L K F(u) 



x 



F(u+^)-F(u) 



+ 



+ e 



x 



F{v- 




x JX 



K' 
F{y- 



F(v) 



K 



K' 



b(x)Kv(dx) 
d(x) + [ a(x,y)u(dy) ) Kv(dx) 
- F(u) 



x 



(2.3) 



m(x, dy)Ku(dx). 



Notice that we rescale the competition kernel a by K so that the system mathemat- 
ically makes sense when we take a large population limit. From the biological point of 
view, K can be interpreted as scaling the resource or area available. 

Let us denote by (A) the following assumptions. 

(Al): 3 b, d, a , a, such that < b(x) < b, < d(x) < d, < a < a(x,y) < 

a, and b(x) — d(x) > 0, Wx E X. 
(A2): V x,y G X, f(x,y) ■ f(y,x) < 0, where the fitness functions 
f(y,x) = b(y) — d{y) — a(y,x)n(x) and n(x) 

f(x, y) = b(x) — d{x) — a(x, y)n(y) and n(y) 



b(x)—d(x) 
a(x,x) 

b(v)-d(y) 



a(y,y) 



Notice that assumption (Al) guarantees that the process with the infinitesimal gen- 
erator (2.3) is well defined (refer to [33]). Assumptions (A2) gives the non-coexistence 



condition for any pair of distinct competing traits. 



3. Early time window on an finite trait space as K ->■ oo 

We firstly review some exsiting results for this model. Champagnat [3, Theorem 1] 
proved the following result by the time scales separation technique, which can be extended 
to a more general case in accelerated population dynamics [15] . 
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Theorem 3.1. Admit assumptions (Al) and (A2). Suppose that Xq 
^> n > as K ->■ +00 , and VC>0, 

exp{-CK} < Ke < 



1 



(L snc/i that 



(3.1) 



Then, (Xfj Ke ,t > 0) converges in the sense off.d.d. to 



Y, 



n 5 x , t = 



' ^ n(r] t )5 Vt , t > 

where the Markov jump process (i] t ,t > 0) satisfies r] = x with an infinitesimal generator: 
A V {x) = [ (<p(y) - ^x))n(x) [I{y ; x)] + m(x, dy). (3.2) 

Jx Kv) 
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FIGURE 2. Simulations of the trait substitution sequence model arising in Theorem 3.1 



Remark 3.2. • The migration time scale is of order whereas the fixation time 
scale starting from one migrant is of order In K . The population is kept monomor- 
phic on the rare migration time scale. The rare migration parameter region con- 
strained by (3.1) is denoted by the region I in Figure^ 
• As showed in Figure [S| it simulates a TSS model with trait space comprising of 
three types in the left panel while it simulates a four-type case in the right panel. 
We mark the population density of trait x , X\, x 2 , x 3 by red, blue, green and black 
colored curves respectively. Take b(x ) = 3, b(xi) = 6, b(x 2 ) = 8, 6(0:3) = 10 and 
death rates d{xi) = 0, i = 0, 1, 2, 3. Take competition kernel a = 1, migration 
kernel m = 0.5, and migration parameter e = K~ 2 , where initial population size 
K = 100. 



In [2], we firstly let K tend to infinity in (2.3) and obtain a deterministic limit. Then, 
we consider the rescaling limit of the deterministic system supported on a finite trait space 
in a slow migration limit. That is actually an extreme case where it attains the so-called 
trait substitution tree by taking a two-step limit along the marginal path (see dashed path 
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in Figure [T]). In terms of the individual-based population, it is of particular interest for 
us to give a microscopic interpretation of the TST process under apropriate constraints. 
Prior to the following theorem, we list the following assumption (B) to attain our main 
result later. 

(Bl): For any finite number of types L e N, it has a monotonously increasing fitness 
landscape: Xq -< x\ -< . . . -< Xl, where x -< X\ denotes 

f{x Q ,x 1 ) < 0, /(xi,x ) > 



(B2): Nearest-neighbor migration and competition, i.e. m(xi,Xj) = a(xi,Xj) = 

for any | % — j | > 1 . 
(B3): For any i > 2, 

V + 1 r + j, 1 r - (3.3) 



Note that assumption (B3) is not necessary for us to obtain the following theorem. 
There actually exist a variety of different possible paths to converge to the equilibrium 
configuration determined up to the ordered sequence of traits as in assumption (Bl). 
However, thanks to assumption (B3), it brings us a lot convenience to prove the theorem 
without losing intrinsic features. 

i 

We inherit some notations from [2], denote configurations by r^ L ^ := n{ x 2i)$x 2i if 

i=0 

i+i 

L = 21 and := £ n(x 2i _i)5 ;E2i _ 1 if L = 21 + 1 for any I G N U 0. 
i=l 



Theorem 3.3. Admit assumptions (Al) and B. Consider the processes (X^) t >o on the 

trait space X = 
K — > +oo ; and 



trait space X = {x , xi, . . . , x{\. Suppose that Xq = ^S xo such that ^> n > as 



1< Ke < K. (3.4) 



Then there exists a constant ti > 0, such that for any t > tz, lim Xf\ i = under 
the total variation norm. 



Remark 3.4. • We illustrate the theorem by simulations (see Figure^. We take 
all the same parameters as in Figured except replacing e = K~5 in the three-type 

3 — 

case and e = K~± in the four-type case with initial population size K = 1000. 
Obviously, they both satisfy conditions (]3.4|). 



The parameter region for rare migration constrained by (3.4) is denoted by the 
upper right region II in Figure\l\ As analyzed in Theorem 3.3 , the fixation time 
scale is of order ln^. The stable configuration for the three-type case is = 
35 xo + 85 X2 and it is = 65 Xl + 105 X3 for the four-type case. We will show 
in Theorem 4J3 that the TST process jumps from to on an even rarer 
mutation time scale of order (see Figure^^. 
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FIGURE 3. Simulations of a trait substitution tree model arising in Theorem 3.3 
three- and four-type trait space. 



4. Late time window with mutations as K ->■ oo 

Following the procedures we build up in [2J, in order to study the evolutionary behav- 
ior driven by new input on a much longer time scale, we introduce a mutation mechanism 



into the population generated by (2.3). We now study the model with mutations formu- 



lated by the following generator supported on a compact set X: 



T F(u) 



x 



x L 



F{u + 
F{v - 



fi_X_, 

K' 

fix \ 

— 

K> 



F(u) 



-F(v) 



+ e 




x J x 




'X 



F(v + 



8 



x+h ■• 

IT- 



b(x)Ku(dx) 
d(x) + [ a{x,y)u{dy) S j Ku(dx) 

m(x, dy)l {yesupp{u}} Kv(dx) 
/i(x)p(x, dh)Kv(dx). 



(4.1) 



F(u) 



Here we denote the process by X K,e,cr with one more superscript a, distinguishing 
from the one without mutation in the previous section. 

Notice that the mutation kernel p(x, dh) is used to introduce a new trait site to 
the previous finite trait space and enlarge the supporting trait space by one each time 
there enters a mutant, whereas the migration kernel only acts on current support sites 
of the population. Later we will see, under some rare mutation constraint (with respect 
to migration rate), the dominating power for fixation is mainly from exponential growth 
of intial migration particles. Before proceeding towards the main theorem, we now give 
some assumptions and the definition of the trait substitution tree, which appeared in [2j. 

Assumption (C). 



(CI): For any given distinct traits {xq,Xi 
order permutation 
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■ , x n } C X, n G N, there exists a total 



Xn.Q ~^ X ni -"\ • ■ • Xn n _ i ~* ^ 



(4.2) 



where x -< y means that the fitness functions satisfy f(x,y) = b(x) — d(x) 
a(x,y)n(y) < 0, and f(y,x) = b(y) - d{y) - a(y,x)n(x) > 0. 



» 



» 



-< 



-< Xn with 



For simplicity of notation, we always assume x" ~' -< x l 
x^ = x n . for < i < n. By adding a new trait x whose fitness is between 



and Xj+i for some < j < n, we relabel new traits as following 



where x[ n+1 ^ = x\ a> for < i < j, Xj^ } = x and xl' 1 ^ 1 ' = x\\ for j+2 < i < n+l. 
(C2): Competition and migration only occurs between nearest neighbors, i.e. for 
totally ordered traits in (CI), we have m(x\ n \ x^) = a{x^\ Xa ) = for | i — j |> 



x 



» 



(n+l) 



-< X 



(n+l) 



-< X 



(n+l) 



-< X 



(n+l) 
n+l > 



(4.3) 



>+i) 



» 



1. 



Under above assumptions we can rewrite the generator (4.1) as following 

5 T 



L K ^F(v 



x 



F(v+^)-F(v) 



b(x)Kv(dx) 



+ 
+ e 



x 



F(v 



K' 



F(u) 



d(x) + 



a: 



x 



[x,y)l{ x - iXtX + } u(dy) Kv{dx) 




x J x 



5,, 8. 



(4.4) 




F(u+ 6 -^)-F(u) 



±{x-,x+}m(x, dy)Kv(dx) 
fj,(x)p(x, dtyKv^dx) 



where x and x + , specified by the total order relation in assumption (CI), are elements 
in supp{z/} C X satisfying 



and 



x = sup{y G supp{z/} : f(y, x) < 0} 

x + = mi{y G supp{z/} : f(y,x) > 0}. 

On the migration time scale, there are a variety of different paths to approach the 
equilibrium configuration by specifying different coefficients. However, the equilibrium 
configuration of a finite trait system is always the same up to the ordered sequence 
determined as in assumption ( CI) , and the time scale for convergence is always of order 
0(ln i) as showed in Theorem |3.3[ 

Definition 4.1. A Markov jump process {T t : t > 0} characterized as following is called 
a trait substitution tree (in short TST) with the ancestor Tq = n(xo)S Xo . 



(i): For any nonnegative integer I, it jumps from := ^ 



i=0 



nix 



(21) 
2i 



)5m ) to r( 2 ^) 



with transition rate n(x^)n{x^')p(x ( 2k' , dh) for any < k < I, where 



,(2/) 



(21) 
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,(20 / (20 (21)^ 



r( 2i + i) = E , =i n{x^)b xm + n(x%> + h)5 xm+h + £< = , +1 ^(4?)^) 

2,z 1. 2 fc 2 2 

z/ t/iere exz'ste < j < I s.t. xff -< x 2 j) + h -< 4j+i> 



r( 2i + D = ^^(42:)^^ + 

z/ t/iere exz'ste < j < / s.t 4j-i 4*P + ^ -< ■ 



r (20 



Then, we relabel the traits according to the total order relation as in ( CI ): 

_(2i+l) 
"2Z+1 ' 



r^r^-^f^c (4.5) 

where in associate with the first case 

xf^ := af> /or < i < 2j, x$+? := sg? + fc, 
:= x£'J for2j + 2<i<2l + l, 
and in associate with the second case 

xf +1) := xf l) /or < z < 2j - 1, x% +1) := x 2 f + h, 
xf +1) := xfl\ for2j + l<i<2l + l. 

(ii): For any nonnegative integer I, it jumps from T^ 2l+1 ^ : = E!=i ^( x n-i' > )^ x ( 2l + 1 ) t° 
Y( 2l + 2 ) w ith transition rate n(x^^)f^(x^^)p(x^^l\ dh) for any 1 < k < I + 1, 

where 

— 1) 2fc — 1 2i 1 

r (2J+l) (21+1) , (2, 

J 2j-1 ^ x 2k-l x 2j 



z/ t/iere exz'sfo 1 < j < I + 1 s.t. x^f-i^ ^ x 2k-i^ + ^ ~< 4^ ' 



r( 2i + D = Ef=i n^-!))^ 1 ) + ESS "(^m-YVxP'+d 

2 (i — 1) 2i 1 

z/ there exists 1 < j < I + 1 s.t x 2 2 j^ -< 4^-1^ + h -< 4j+i^- 



Then, we relabel the traits according to the total order relation as in (CI): 

J2/+2) (2«+2) 
y 2Z+l ^ x 2Z+2 ' 



{21+2) (2Z+2) (2Z+2) (2Z+2) / . fi x 

x -< x x -<•••-< x 2l+1 -< x 2l+2 , l 4 -°J 



where in associate with the first case 

xf^ := xf^ for < i < 2j - 1, xg' +2 > := x^ + 
+2) := xf!t 1] for 2j + 1 < i < 21 + 2, 
and in associate with the second case 

x?+ 2) ■= x ? l+1) for < z < 2j - 2, x%+? := x ™ + 

x ? +2 > := xf!t 1] for 2j < i < 21 + 2. 
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Remark 4.2. According to the definition, the new configuration is constructed in a way 
that every alternative trait gets stabilized when one "looks down" from the most fittest 
trait along the declining fitness landscape. Once a mutant is inserted in between two levels 
(say, i and % + 1), we relabel all the traits above from the mutant's level. However, the 
mutation only alters the equilibrium configuration below i + 1th level but not above. This 
construction is similar to the look-down construction of Coalescent processes (see 



Theorem 4.3. 



t > 0} 



Admit assumption (Al) and (C). Consider the process {X t 

described by the generator (4.4). Suppose that X(f' e,<T = ^-5 XQ and > n{xo) in law 

as K oo. In addition to the condition (3.4), suppose it also holds that 



, 1 1 

In - < — 
e Ko 



< e 



KC 



for any C > 0. 



(4.7) 



Then (X^^) t >o converges as K — >■ oo to the trait substitution tree (T t )t>o defined in 
Definition 4-1 in the sense of f.d.d. on M. F [X) equipped with the topology induced by 



mappings v i— > (u, f) with f a bounded measurable function on X . 



M/iikiliiyli 



500 



1 

1000 

output. T 



1 — 

400 
output.T 



FIGURE 4. Simulations of a trait substitution tree on the mutation time scale arising 



in Theorem 4.3 on four- and five- type trait space. 



Remark 4.4. 



There are two time scales for the individual-based population, which 
can be observed from Theorem 3.3 and the generator (4.4). One is the fixation 



time scale of order In \ while the other one is the mutation time scale of order 



which are constrained on LHS of the inequality (4.7). By adopting the time scales 



separation technique used in [3], we can get a nice limiting structure-TST in the 
large population limit. The RHS of the inequality (4.7) is used to guarantee that 



system can not drift away from the TST equilibrium configuration on the mutation 
time scale (see Freidlin and Wentzell [15] ). 

As it is showed in Figure we simulate the trait substitution tree processes by 
introducing a mutation mechanism. Note that the simulation shows a special case 
where the population always reproduces a mutant which is more fitter than any of 
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already existing traits. The birth rate of red-colored population is 3, while the blue 
one, the green one, the black one and the yellow one have birth rates 6, 8, 10, 12 
resp.. Their death rates are constant 0. We take e = K~ ' 8 and a = K~ 15 , where 
initial scaling parameter K = 400. On a longer mutation time scale, the fixation 
process due to migration is not visible any more. However, if we zoom into the 
infinitesimal fixation period, pictures as in Figure [3] will emerge. 



We begin by introducing some terminology from population genetics in the same 
setting as [321 Chapter 10.1]. 

In this section we restrict our attention on one-locus diploid populations, where the 
chromosomes occur in the form of homologous pairs. More precisely, an individual's 
genetic makeup with respect to a particular locus, as indicated by the unordered pair of 
alleles situated there (one on each chromosome), is referred to be its genotype. Therefore, 
if there are n possible alleles, i.e. finite allele space A n := {A±, . . . , A n }, at a given 



the genotype space by G n := {(Ai,Aj), 1 < i < j < n}. Without loss of generality one 
can take the total allele space A to be a compact subset of M. d , or simply a continuum 
interval [0, 1]. Denote the total genotype space by G. To keep consistent with notations 



where $ is symmetric for i, j, and traits (xi)i<i<h(n) are ordered according to their relative 
fitness as in assumption (CI). In accordance with notations in Section |5J we list the 
following parameters for diploid populations. For any g e G, 

• denote by B(g) := b(<&(g)) the birth rate from an individual of genotype g. 

• denote by D(g) := d($(g)) the death rate of an individual of genotype g because 
of "aging". 

• denote by a(gi, g 2 ) := a(§(gi), $(#2)) the competition kernel felt by an individual 
of genotype g\ from another of genotype g 2 . We restrict the competition acting 
on individuals of the same genotype and within the nearest-neighbors (ordered in 
terms of their fitness values). 

• denote by rh(gi,dg 2 ) the replacement law of an individual of genotype g± by an- 
other g 2 when one of allele pair of the mother individual g\ undergoes fusion with 
another gamete allele from any father to form a new type g 2 . Thus, g 2 is chosen 
to be any genotype fused in a way that one allele is from gi and the other one can 
be anyone from the whole supporting allele space. 

• denote by fi(g) := yu($(g)) the mutation rate of an individual of genotype g. 

• denote by p(A, da) the law of mutant variation between a mutant and its resident 
allele type A. 

We also limit our discussion to monoecious populations, those in which each individ- 
ual can act as either a male or a female parent. The reproductive process can be briefly 
described as follows. 



5. Application to a diploid population 




possible genotypes. We denote 
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Sexual reproduction. Sexual reproduction is a creation of a new organism by 
combining genetic gametes from two parental individuals. Suppose each individual can 
produce a large amount of germ cells, cells of the same genotype. Some germ cells split 
into two gametes (each carries one chromosome information from the homologous pair in 
the original cell), and any two gametes from different individuals form another genotype 
which is different from the parents. The above procedure is called meiosis and fusion of ga- 
metes. Note that sexual reproduction can be realized in a manner of gamete replacement 
of one of the mother individual's gamete. For instance, the effect of reprodcution from a 
(mother) individual of genotype (Ai, Aj) by fusing with any gamete Ak generated from 
any other (father) individual is a replacement of the individual (Ai, Aj) by a new individ- 
ual of genotype either (Ai,Ak) or (A^,Aj). We assume that each father individual can 
generate enough amount of germ cells to provide gametes for the replacement procedure 
of mother individuals. The size of offspring individuals resulted from sexual reproduction 
only depends on the size of replaced mother individuals. Based on the above simplicity, 
one can think of sexual reproduction as a spatial migration of an individual from a geno- 
typic site (A i: Aj) (say g±) to another genotypic site (Ai,Ak) or (Ak,Aj) (say #2) with 
migration kernel rh(gi,dg2), and the migration rate only depends on the size density of 
the (mother) genotype (Ai, Aj). Then we can employ the results on the spatial migration 
model in Section [3] and Section |4j which follows later. Here the sexual reproduction is 
based on allelic level but is represented on genotypic level. 

Clonal birth and death. Besides the sexual reproduction, meanwhile, there are 
also reproductive birth which does not apply fusion of gametes from two parents. Instead, 
an offspring carries a clonal copy of the parent's genotype g with birth rate B(g). This 
kind of local birth is carried out as defined in Section [2] Similarly as before, the death of 
an individual is governed by a quadratic form of its own size density and the density of 
its nearest-neighbors. These two events are based on genotypic level. 

Mutation. Mutation occurs due to a change of one of alleles in a genotype, and 
the allele space is enlarged by one. Suppose there are n different allele types before a 
mutation event. After mutation, the genotype space is enlarged to be of size h(n + 1) 
from h(n). The mutation event is governed by a mutation kernel p(A,da) on the allele 
space A. The mutation event is based on allelic level. 

Notice that all the above events are density dependent, which means their transition 
rates are proportional to the local density of the population size. We consider it as finite 
measure- valued processes ($t)t>o on the genotype space G. Denote by Mf(G) the totality 
of finite measures on G. Now we write down the infinitesimal generator of the diploid 
model with 1 / if-scaled weight for a proper testing function F 
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C K F(Q) 



+ 



+ e 



G 



G 



F(g + ^)-F(g) 
5 n 




G JG 




G J A 



F(9+"f-^)-F( g ) 
'f( 5 + 6 -^f^) - F(g) 



B{g)Kg{dg) 
D(g) + [^a(g,g)g(dg)^j Kg(dg) 

rh(9i, dg 2 )l{g 2 : gi ng2^0;g2£su PP { s }}Kg(dg 1 



6 g ((A u A 2 ))fi(g)p(A 1 , da)Kg{dg) 



(5.2) 

where g\ fl g 2 7^ means that genotype gi and g 2 share at least one same allelic type 
in their allele pairs. 

Definition 5.1. A Markov jump process (A t ) t > with initial A = UqSiai^i) f or some 
A\ G A is called a genotype substitution tree if it satisfies the f ollow s. For any given 
n G N and A n = {Ai, A 2 , . . . , A n }, let T^ n '' ( defined in Definition 4-1) be the equilibrium 
configuration for traits distribution determined by genotypes in G n = {(A4, Aj) : 1 < i < 



j < n} and (5.1). Let be 's equivalent form as the corresponding equilibrium 



on the genotype space. Suppose that the number h(n) is even, the transition rate for the 
Markov jump process (A.) from to A (n+1 ) due to mutation variation of magnitude a 
on an allele Aj (1 < j < n) is 

h(n)/2 

^2 n(x 2i )fi(x 2i )(l + 5 i (A;))l{ a . 2<= $((^. )J 4 fc )) i 3i< fe < n }p(A i ,da) (5.3) 



i=l 



where {xi)i<i<h(n) are indexed according to a fitness increasing order of {$((Aj, Aj)) : 1 < 
i < j < n}. 

Note that (1 + Sj(k)) appeared in above rate is used to distinguish cases whether the 
mutant allele is from a homozygous genotype or not. 

Theorem 5.2. Consider a sequence of processes {(jjf )t>o,K G N} defined by above gen- 
erator with initial condition q$ = n^^Ax^), where Uk converges to n as K — >■ 00. // 



K 



mutation and migration rates satisfy constraints (3.4) and (4.7), as K — >■ 00, 

converges to a genotype substitution tree process (A t ) in the sense of f.d.d. on Aip(G) 
equipped with the topology induced by mappings g h-> (g, /($(•))) with f a bounded mea- 
surable function on X . 



The above theorem can be obtained as a corollary of Theorem 4.3 

6. Outline of proofs 

In order to illustrate the basis idea of proofs, we start with a three-trait toy model. 
But notice that our analysis is not reduced only to the three-trait case. All the machinery 
is still available for any finite-trait space, which will be shown later. However, the explicit 
proofs are more difficult to write down without some restrictive conditions. That is why 



we impose assumption (B3) in Theorem 3.3 
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Proposition 6.1. Admit the same condition as in Theorem 3.3 . Consider a sequence of 
processes on a trait space X = {x , ^2}- Then, there exists a constant ti > 0, such 
that for any t >t 2 



lim X. 



under the total variation norm. 
Proof, (see Figure [5]). 



Let (x ) := \ and g( Xi ) := ^- = (Xf , for i = 1,2. 

density 



r (2) 



n(xo) 



n(cci) 



71(2:2) 



n(xo) 



n(x 2 ) 



a 2 o 2 



o'l 
°0 



time 



growth of re! 



(6.1) 



growth of x 2 recovery of * 

FIGURE 5. Phase evolution of mass bars in early time window on the three-trait site space 



Step 1. Firstly, consider the emergence and growth of population at trait site x\. 

Set S{ = inf{t > : £f(xi) — e }- Thanks to ^8 > n > in law as K — > 00 and by 

applying the law of large numbers of random processes (see Chap. 11, Ethier and Kurtz 
1986), one obtains from the last term in generator ( |2.3 ) that, for any S > 0, T > 0, 

ef(^i) 



lim P I sup 

A'^oo \0<t<T 



< 5 



where n t {xx) is governed by equation n(xi) = m(x ,Xi)n with initial n (xi) = 0. There- 
fore, 

lim P ( . 1 . 5 < S{ < . 1 . + 5 ) = 1, (6.2) 

k^oo \m(xo,Xi)no m(xo,Xi)n J 

that is, S{ is of order 1. 

For any r) > 0, set = inf{t : t > S|, (,f(xi) > 77}. Consider a sequence of rescaled 



processes I 



A'; 



i>5| 



With -|l = 3 

Ke e 



— > 1 as K — > 00. As before, by law of large 



numbers of random processes (see Chap. 11 Ethier and Kurtz 1986), one obtains, for any 
5>0, T>0, 

T K,1 



lim P sup 



if— >oo 



0<t<T 



Ke 



m t 



< 5 



1- 



(6.3) 



TRAIT SUBSTITUTION TREE 



17 



where m t is governed by equation m = f(x\,xo)m = (b(xi) — d(xi) — a(xi, Xo)n(xo)) m 
with mo = 1. 

Set T? /e = M{t - S{:t> S{,^-> rj/e}, and tf e = M{t > : m t > ry/e}. Then, 
for any 5 > 0, there exists 5' > such that 

lim P ( ( 1 — - - 5 ) In - < 5? - < f 1 — - + 5 ) In - 

k^oo \\f(xi,xo) J e \f(xi,x ) J e 



lim P ( ( 1 - *) In - < I? /e < ( 1 + *) In - 
limpf (*—-*) \n l -<tt<( - f 1 , +-1 In-. 



(6.4) 



sup I — m t I < 

o<t<q /c Ke 

--1 



where the last equal sign is due to (6.3). 



After population of trait X\ reaches some 77 threshold, the dynamics (£f(xo),£f(xi)) 
can be approximated by the solution of a two-dimensional Lotka-Volterra equations. 
Then, it takes time of order 1 (mark this time coordinator by S^) for the two subpopu- 
lations switching their mass distribution and gets attracted into 77— neighborhood of the 
stable equilibrium (0, n(x\j). 

Step 2. Now consider the emerging and growth of population ^(x 2 ) : = (X^, l{ x - 2 }) 
at trait site x 2 . Set S 2 = inf{t : t > S^, ^(x 2 ) > e}. Similarly as is done for S{ in 



(6.2), one can get that lim F(S 2 — S± = 0(1)) = 1. On a longer time scale, we will not 

' K— >oo 



distinguish S 2 from S{ 

Set S 2 = inf{t : t > S 2 , ^(x 2 ) > 77}. One follows the same procedure to derive (6.4) 
and asserts that for any 5 > 0, 

lim P ( ( T — 1 — - - s] In- < S n 2 - S r > < (-=-; — - — - + 5) In - ) = 1. (6.5) 



K ->°° \\f(x 2 ,xi) ) e f(x 2 ,xi) e 

Note that assumption (B3) i > i yX2 yLd{x 2 ) — J{x\ x ) JixtxT) g uaran ^ ees that ^{x 2 ) can not 
grow so fast in exponential rate b(x 2 ) — d(x 2 ) such that it reaches some 77-level before S 2 - 
During time period (S^, S 2 ), population at site xq, on one hand, decreases due to the 
competition from more fitter trait x\. On the other hand, it can not go below e level due 
to the successive migration in a portion of e from site x\. More precisely, by neglecting 
migrant contribution, ^(xq) converges n t (xo) in probability as K tends to 00, where 



h t (x ) = (b(x ) -d(x ) - a(x ,xi)ri(xi)) n t (x ) = f(xQ,xi)n t (xo) 



(6.6) 
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lim P UsJsi x o) e (n A 5j(^o) - S,n A£ m(x ) + 5) 

A— s>oo \ 2 2 
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lim P (rjeKn)^ _ 5 < £f,(:ro) < ^iWOASj + tf 



(6.7) 



ljm P ^ e l/Wi)l//Wi) _ 5 < £|(>o) < ^el/C^o^!)!//^,^!) + tf 



where the second equality is due to (6.5). Taking the migration from site X\ into account, 
we thus have 



K— >oo 



lim P Ufv(x ) = O( e l/(*°>*0l//>2,*i) v 



1. 



(6.8) 



We proceed as before for S± in step 1. After time S 2 , the mass bars on dimorphic system 
(^( x i)j h (^2)) can be approximated by ODEs and will be switched again in time of 



order 1 (marked by 5*2 as in Figure |5J), and they are attracted into rj— neighborhood of 
(0,n(x2)). As for the population density on site xq, one obtains from (6.8) 

lim fe(x ) = O(e C1 )) =1 (6.9) 

where d = A 1< 1. 

Step 3. We now consider the recovery of subpopulation at trait site xq. Recovery 
arises because of the lack of effective competitions from its neighbor site Xi, or under 
negligible competitions since the local population density on x\ is very low under the 



control of its fitter neighbor x%. Without lose of generality, we suppose c\ :- 
in ([6^81. 



< 1 



£§n(*o) 

b 2 



Set Sq = mi{t : t > S 2 , ^{xq) > 77}. We proceed as before in step 1. From (6.8), 
converges to some positive constant (say mo) in probability as K — > 00. Thus, by 



N 

applying law of large numbers to the sequence of processes for any 5 > 0, T > 

*f(*o) 



lim P sup 



A"— >oo 



0<t<T 



- m t 



< S 



1 



(6.10) 



where m t is governed by logistic equation rh = (b(xo) — d(xo)) m starting with a positive 
initial m^. 

Following the same way to obtain (6.4), time length Sq — S 2 can be approximated 
by time needed for dynamics m to approach r]/e Cl level, which is of order ^ XQ y: d ^ Xo ^ m \, 
i.e. for any 6 > 0, 



lim P 

K— >oo 



Cl 



b(x ) - d(x ) 



<) I In - < S v - SI < 



Cl 



b(x Q ) - d(x ) 



+ 6 In 



1. (6.11) 



At the same time, £f(xx) converges in probability to ip t which satisfies equation if> = 
f(xi,x 2 )4> with ipgr, = 77. Then, we can justify the following estimate for population 
density at site X\, 

l,im P = 0(6 C2 Ve)) = l (6.12) 
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where c 2 = ti S o)-dl^y 



We now combine all these estimates (6.4), (6.5), (6.11) together, and conclude that 



A'— > 

for t > i 2 :— ~T? — ~ — v + 77 — — v + t> — vh? — \ under the total variation norm II • II on M. F (X). 

z f(xi,xo) f(x 2 ,x 1 ) b(x )-d(x ) II H 

□ 



lim P (\\Xf h i - T (2) || < S) = 1 (6.13) 



Proof of Theorem 3.3\ We proceed the proof by the induction method over the superscript 



L G N of trait space X^ L > = {x , X\,..., xl} 

(1) . When L = 2, it is already proved in Proposition 6.1 that there exists a constant 
i 2 > such that for any t > i 2 

LmX^SrM (6.14) 

K— >O0 e 

under the total variation norm. 

(2) . Without loss of generality, suppose it holds that for any L = 21 there exists a 
constant t 2i such that for any t > i 2l 

BmA^ffll™. (6.15) 

We need to prove the same relation also holds for the case L — 21 + 1. 
We firstly consider the invasion time scale of population at site x 2 i+i- 
Denote by ^(a^z+i) := (Xf Afai+i})- ^ Ke 2l+1 <C 1, it follows a similar proof as 



in Proposition 6.1 So, now we only need to consider the case when Ke 2l+1 ^> 1, that 



N K 

is, the mass at site x 2 \+\ is large in the very beginning. In fact, since -g > n in law 

as K — > oo and the nearest-neighbor mass migrates from site x to site x 2 m by passing 
through xi, . . . , x 2 i, one applies the law of large numbers for random processes and obtains 
that 

lim P ( sup |ef (x 2 i+i) - n t (x 2l+1 )\ < 5 ) = 1 (6.16) 

K^oo \0<t<T J 



where n t {x 2 i + i) satisfies the equation h t {x 2 i + i) = e 2l+l Yif=i m { x ji x j-i) n o- So, it takes 



r- 



time of order 1 for £^ (a^j+i) to reach e 2l+l level (mark the time coordinator by S^z+i)- 

Set S v 2l+l = w£{t : t > Sf z+1 ,6f (x 2 i+i) > V}- For t G {S 2l+l , S^ l+1 ) , again by law of 
large numbers, - j^+i" 1 ^ converges to <p t which satisfies 0q = 1 and 

<j> = (b(x 2l+1 ) - d(x 2l+1 ))<j>. (6.17) 

Thus, A5*2 i+1 := S 2 i + i~ S 2 i+i can be approximated by the time length (say At 2 «+i) needed 
for dynamics <fi to reach i]/e 2l+1 level, i.e. 

lim Y~~4 ,-s] \n-<AS^ l+1 <(- . + ln- 

K-^oo \\b(x 2l+1 ) - d(x 2l+1 ) J e 21+1 \b(x 2l+1 ) - d(x 2l+1 ) J e 



i. m (( 21 + 1 A, 1 . / 21 + 1 \ 1 

lim P - <5 In - < At a +i < 77 : 77 r + 5 In - 

a-^oo y V. 0(^+1) - d{x 2 i+i) J e \b{x 2 i +1 ) - d{x 2 i +1 ) J e 



1. 



(6.18) 
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We inherit the notation as the hitting time of 77- level for the population at site x 2 \. 



Due to the hypothesis for L 

\I(X21,X 2 1-1 



21 case, we know that is of order 



-1 



[/(«!, ar )] 



-1 



In 



Thanks to assumption (B3), i.e. 

21 + 1 



> 



+ . . . + 



(6.19) 



(6.20) 



b(x 2l+1 ) - d(x 2 i+i) ' f(x2i,x 2 i-i) f(xi,x ) 

it implies that before time S^, population at site x 2 i+i is still under negligible level (of 
order e c for some positive constant c) and can not influence the invasion process up to x 2 [. 



Following a similar procedure as deriving (6.19) (see Figure™ to analyze the colo- 



nization of population at site X2/+1 due to migration from site x 2 i with exponential rate 
f(x 2 i + i,x 2 i), one obtains that S^ l+1 should be of order 



{[f(x 2l+1 ,x 2 i)] 1 + . . . + [/(xi,x )] Jin 



(6.21) 



Comparing two time scale estimates (6.18) and (6.21) for S^,! under assumption 
(B3), one gets (6.21) is the right one for the fixation of population at site x 2 i+i- 



Now we consider the total recovery time by summing up recovery time of all subpopu- 
lation on every second site backwards from x 2 i + i to xq, one can do calculations repeatedly 



as in Step 3 of the proof for Proposition [ClT] More precisely, for 1 < i < I, the initial popu- 
lation ^ K {x 2 i-\) on site x 2 i-\ which is prepared for recovering is no less than e-level due to 
the consistent migration from its fitter neighbor site x 2 i ■ On the other hand, it grows expo- 
nentially at least with a rate /(a^i-i, £21-2) = b(x 2i -i) - d{x 2i -x) - a(x 2 i-i, x 2i - 2 )n(x 2i - 2 ) 
due to the possibly strongest competition from its unfit neighbor x 2 i- 2 - In all, the recov- 
ery time (mark by S^i) for the population £ K (x2i-i) to reach 77- level can be bounded 
from above 



lim P 

K— >oo 



1 



f{x 2 i-l, X 2 i_ 2 ) 



+ 5 In 



1. 



We now combine both time estimates (6.21) and (6.22). Let 

2 



21+1 



[/(zjh+i, »2i)] 1 + • • • + [f{xi,x )] 



(6.22) 



(6.23) 



Then, one can conclude that for any t > t 2t+ i, for any 5 > and < i < I, 



lim P 

K— YOO 



(6.24) 



It follows the conclusion for any t > t 2 i+i, 



lim X« 1 ( = } 

is tm- 



p(2/+l) 



(6.25) 
□ 
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Proof of Theorem \4-3\ The proof of this result is similar to the proof of [3J Theorem 1] . 
We will not repeat all the details and only focus more on supporting lemmas which are 
cornerstones of the proof. 

For any e > 0, i > 0, L 6 N, 5 C A 1 measurable, take the integer part Li := 
and denote by 



A K ' e ' a (e 



t,L,B) : = < Supp(X K ^' cr ) has L + l elements, and L\ out of them, say {x\, . . . 



< e, 1 < i < Li, 



x Ll } C B, satisfy (X^' a , l {Xz} ) - n(a:<) 

Ka 

and the other L + 1 — L\ traits, say yi, . . . , Vl-Lh satisfy 
(X^,l {yj} ) <e,l<j <L + l-L\ 

K<7 J 

(6.26) 

To the end, it is enough to establish that 

lim P (A K ' e ' a {e, t, L, B)) = P (Supp(r 4 ) C B and has L x elements ) (6.27) 

A"-»oo 



where (r t ) t > is defined in Definition 4.1 



The first key ingredient of the proof is the characterization of exponentially dis- 
tributed waiting time of each mutation event. It can be proved from the expression of the 
generator (4.4) as done in [31 Lemma 2 (c)]. We will not show the details here. 

h.,.o _ p(i)^ w o J ^ take L = 21. Let r be the first mutation 



Lemma 6.2. Assume that XQ' e ' a 



time after 0. Then, 



lim P r > 



K— >oo 



Ka 



K^oo 



lim P I at time r, mutant comes from trait x 



exp ( -t^™(4?V(4f; 



i=0 



(21) 
2k 



eS=o 



(20 s 
2i i 



(6.28) 



(6.29) 



The second ingredient can been seen as a corollary of Theorem 3.3 It demonstrates 
that fixation of new configuration takes time of order ln^, which is invisible on the 
mutation time scale. 



Lemma 6.3. Assume that X^' e ' a 
exists a constant C > 0, for any 5 > 0, such that 

1 



p(22) _|_ i ,5 , „ f or some o < k < I. Then there 

K x\.'+h J — — 



lim P I t > C In 

K— >oo 



sup 

6 te(CIni,r) 



X 



K,e,a 
t 



r ( 2/+1 )|| <5 \ =1 



(6.30) 



where T^ 2 ' +1 ^ is defined as in Definition 4-3 (i) and || • || is the total variation distance. 



Proof. From Lemma (^2, one concludes that, for any C > 0, 

limP(r £ > Cln-) = 1. 

e->-0 v e 
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According to the fitness landscape, there will be one and only one ordered position for 



the new arising trait x 2 ^ + h in T^ 2l \ Suppose there exists x 2 2l) such that x ( 2 ^ + h fits 



(20 



„(«) 



between x 2 l[ and a^i+i- Then, one has the local fitness order 



(20 



X 2j-1 ~^ X 2j 



-< x?}' + h -< x 



2k 
(21) 



(21) 
2j+V 



(6.31) 



Since it is unpopulated for both traits a^j-i anc ^ x 2j+i m we consider ( 



x 



(20 „(2Z) 



2j 



X 



2k 



' + 



/i) as an isolated pair without competition from others. As the same analysis as being 
done in Proposition 6.1, the two-type system will converge to (0, n(x 2 k + h)5 (21) ,) in 



time of order O (in jj. On the right hand side of the isolated pair, nothing changes due to 

(2/) . 

2j_i increases exponen- 



(21) 

their isolation. Whereas on the left hand side of the pair, trait x 



(21) 

tially due to the decay of its fitter neighbor x 2 j ■ So on and so forth, the mass occupation 

(21) 

flips on the left hand side of x 2 j ■ As the same arguments in the finite trait space case 



(see Proof of Theorem 3.3), the entire rearrangement process can be completed in time of 
order 0(ln -). 

In a similar method, we can prove the other case when the fitness location of x 2k + h 



is on the left hand side of xffi, that is, 



x 2j-l ^ x 2k 



+ H -4, X 2 j ~i %2j+l m 



In all, we conclude the new configuration r^ 2 ' +1 ^ by relabeling the traits as done in 

□ 



Definition 4.1 



Thus we conclude the proof of the Theorem 4.3 



□ 
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